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We study Markov bases of decomposable graphical models consisting of primitive moves (i.e., 
square-free moves of degree two) by determining the structure of fibers of sample size two. 
We show that the number of elements of fibers of sample size two are powers of two and 
we characterize primitive moves in Markov bases in terms of connected components of induced 
subgraphs of the independence graph of a hierarchical model. This allows us to derive a complete 
description of minimal Markov bases and minimal invariant Markov bases for decomposable 
models. 
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1. Introduction 

Since Sturmfcls (1996) and Diaconis and Sturmfcls (1998) introduced the Markov chain 
Monte Carlo approach based on a Markov basis for testing the goodness of fit of statis- 
tical models of multiway contingency tables, many researchers have showed the useful- 
ness of the approach and studied Markov bases for various kinds of statistical models 
in computational algebraic statistics (e.g., Ho§ten and Sullivant (2002); Dobra (2003); 
Dobra and Sullivant (2004); Gciger et al. (2006)). Hierarchical models are of basic im- 
portance for statistical analysis of multiway contingency tables (e.g., Lauritzen (1996); 
Agresti (2002)). As illustrated in Aoki and Takemura (2003), however, the structure of 
Markov bases for hierarchical models is very complicated in general. Decomposable mod- 
els defined in terms of chordal graphs are particularly useful submodels of hierarchical 
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models. They are known to possess Markov bases consisting of primitive moves, that is, 
square- free moves of degree two (Dobra (2003); Hostcn and Sullivant (2002); Geigcr et al. 
(2006)). Dobra (2003) provided an algorithm to generate moves in such Markov bases 
based on a clique tree of the chordal graph defining the model. 

The main purpose of this paper is to clarify structures of Markov bases consisting 
of primitive moves for decomposable models. As shown in Takemura and Aoki (2004), 
Markov bases for general models can be constructed by combining moves of increas- 
ing degrees. This fact indicates the importance of studying the structure of primitive 
moves in order to clarify the structure of Markov bases for more general hierarchical 
models. Some practical models such as subtable sum models (Hara et al. (2009)) and 
quasi-independence models for incomplete contingency tables that contain some struc- 
tural zeros (Aoki and Takemura (2005); Rapallo (2006)) are obtained by imposing some 
constraints on a decomposable model. Rasch models (e.g., Chen and Small (2005)) and 
many-facet Rasch models (e.g., Zhu et al. (1998); Basturk (2008)) that are commonly 
used in psychomctrics and bchaviormctrics arc considered as decomposable models re- 
stricted to contingency tables in which cell frequencies are zeros or ones. From a practical 
viewpoint, detailed properties of Markov bases for decomposable models may also give 
insights into Markov bases for such models. 

The present authors have been studying Markov bases from the viewpoint of 
minimality (Aoki and Takemura (2003); Takemura and Aoki (2004)) and invariance 
(Aoki and Takemura (2008a, 2008b)) for some specific hierarchical models. The notions 
of minimality and invariance of Markov bases are important because they give concise 
expressions of the Markov basis. In this paper we extend the results to decomposable 
models. 

The set of contingency tables sharing the same marginal frequencies corresponding 
to the generating set of the model is called a fiber. The structure of primitive moves is 
equivalent to that of fibers of sample size two. Wc study the structure of fibers of sample 
size two in detail and give a complete description of minimal Markov bases and minimal 
invariant Markov bases for decomposable models. We also show that construction of a 
minimal invariant Markov basis is directly related to a basis of a vector space over the 
finite field GF(2). We describe under what conditions Dobra's Markov basis is minimal 
or minimal invariant. We also give a necessary and sufficient condition for the uniqueness 
of the minimal Markov basis for decomposable models. 

The organization of the paper is as follows. In Section 2 we set up notation for this 
paper and summarize preliminary results. In Section 3 we clarify structures of fibers of 
sample size two. Using this characterization, in Section 4 we give a complete description 
of minimal Markov bases and minimal invariant Markov bases for decomposable models. 
In Section 5 we briefly discuss reduced Grobner bases for decomposable models and we 
end the paper with some concluding remarks in Section 6. 

2. Preliminaries 

We mostly follow the notation in Lauritzen (1996); Ho§tcn and Sullivant (2002); Dobra 
(2003) for multiway contingency tables. Let A = {1, . . . , m} denote the set of variables of 
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an m-way contingency table. Let Is, 6 6 A, denote the number of levels of the variable 
5. For convenience we take the set of levels of the variable 5 as Is = {0, 1, . . . ,Ig — 1} 
starting from as in Ho§ten and Sullivant (2002). The cells of the contingency tabic are 
indexed by 

i = (ii,...,i m ) 61= Y\. Is 
<5eA 

n(i) denotes the frequency of the cell i and n = {n(i)}igi denotes an m-way contingency 
table. The set of positive cells supp(n) = {i G I \ n(i) > 0} is the support of n. 

For a subset D C A of the variables, the /^-marginal ri£> of n is the contin- 
gency table with marginal cells ijj G Id = YlseD-^ s ana - entries given by njj(ir)) = 
Si c ei c n(*D,'flc)- Here we are denoting i= (irj,i D c) by appropriately reordering 
indices. In this paper for notational simplicity, appropriate reordering of indices is per- 
formed as needed. 

Now we consider the existence of a table n with the marginal tables , . . . , iiD r . 
Dobra (2003) defined that the marginal tables , . . . , no r are consistent if, for any r\ , 
r2, the (D ri (~1 D r2 )-marginal of no ri is equal to the (D ri H LV 2 )-marginal of n^ r2 ■ The 
consistency of the marginal tables is obviously a necessary condition for the existence of 
n. However we note that it does not necessarily guarantee the existence of n in general 
(e.g., Irving and Jerrum (1994); Vlach (1986)). 

Let V = {Di, . . . , D r } be the set of facets of a simplicial complex such that A = 
{J~j =1 Dj- Then T> is called a generating class. Let p(i) denote the cell probability for 
i. Then the hierarchical model for a generating class T> is written as 

logp(i) = ^D(i), 
Dev 

where fin depends only on ijj. 

Let Q v be a graph with the vertex set A and an edge between 6,5' £ A if and 
only if there exists Del) such that 6,5' £ D. Q v is called an independence graph 
of V (Dobra and Sullivant (2004)). A hierarchical model for T> is called graphical if 
V = {Di, . . . ,D r } is the set of (maximal) cliques of Q v . By a clique we mean the set of 
vertices of a maximal complete induced subgraph. A graphical model is called decom- 
posable if (j is chordal, that is, every cycle of Q v with length greater than three has a 
chord. A clique tree (or a junction tree) T of a chordal graph Q v is a tree, such that the 
vertices of T are cliques of Q v and it satisfies the following property: 

D s n D t C D u for all D u on the path between D s and D t in T. 

An intersection S of neighboring cliques in a clique tree is called a minimal vertex sepa- 
rator. In the following S denotes the set of minimal vertex separators of a chordal graph. 
When Q' is not connected, we regard the empty set as a minimal vertex separator of 
G v . 

For a clique D G T> of a decomposable model, let Simp(D) denote the set of sim- 
plicial vertices in D and let Sep(D) denote the set of non-simplicial vertices in D 
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(Hara and Takemura (2006)). If Simp(£>) ^ 0, D is called a simplicial clique. A sim- 
plicial clique D is called a boundary clique if there exists another clique D' G X> such that 
Scp(D) = DnD' (Shibata (1988)). Simplicial vertices in boundary cliques are called sim- 
ply separated vertices (Hara and Takemura (2006)). Hara and Takemura (2006) showed 
that a clique D is a boundary clique if and only if there exists a clique tree such that D 
is its endpoint. Hence there exist at least two boundary cliques in any chordal graph. 

Finally we summarize some relevant facts on fibers and Markov bases (Takemura and Aoki 
(2004, 2005)). Given the generating class V = {Di, . . . ,D r } of a hierarchical model, we 
denote the set of marginal frequencies as 

b = {n Dj (i Dj ), i D] e X Dj , j = 1, . . . , r}. 

We consider b as a column vector with dimension d = Ylj—i Yis^D Is> wnere the elements 
are ordered according to an appropriate lexicographical order. We also order the elements 
of n appropriately and consider n as a column vector. Then the relation between the 
joint frequencies n and the marginal frequencies b is written simply as 

b = An, 

where A is a d x \X\ matrix consisting of 0's and l's. A is the "incidence matrix" of cells 
and marginals with 1 indicating that the corresponding cell (column) is included in the 
corresponding marginal (row). 
Given b, the set 

= {n > | b = An} 

of contingency tables sharing the same marginal frequencies, b, is called a fiber or h-fiber, 
where n > denotes n(i) > for all i € I. All contingency tables n in the same fiber J 7 ^ 
have the same total frequency n = X^ez^W- We call this common total frequency the 
sample size or the degree of b and denote it by degb. We call J-b with degb = 2 a "degree 
two fiber" in the following. 

An integer array z = {z(i)}i e x of the same dimension as n is called a move if Az = 
0, that is, zr> (in) '■= J2i c ei c z (iD,iD c ) = for all D 6 V. A move z is written as 
the difference of its positive part and negative part as z = z + — z . Then Az + = Az~ . 
Therefore z + and z belong to the same fiber. In this case we simply say that a move 
z belongs to the fiber J-~Az.+ ■ We call degAz + the degree of a move z. Clearly degAz + > 
2. Especially when z is a primitive move, that is, a square-free move of degree two, 
dcgAz + = 2 and z + and z belong to the same degree two fiber. Therefore the structure 
of primitive move is equivalent to the structure of corresponding degree two fiber. If we 
add a move or subtract a move ztong !F\ 3l we can move to another state n + z (or n — z) 
in the same fiber T\, as long as there is no negative element in n + z (or n — z). A finite 
set M. of moves is called a Markov basis if for every fiber the states become mutually 
accessible by the moves from M. . By using the Metropolis-Hastings procedure to control 
transition probabilities by moves of a Markov basis, we can construct a Markov chain on 
every fiber (Diaconis and Sturmfels (1998)). 
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A Markov basis M is minimal if every proper subset of M is no longer a Markov 
basis. Minimal Markov bases may not be unique in general. However, in view of the 
definition of the minimum fiber Markov basis (the set of moves that cannot be replaced 
by a sequence of moves of lower degree, see Section 2.2 of Takemura and Aoki (2005)), 
the fibers of the moves of all minimal Markov bases are common. We refer to the set of 
fibers common to all minimal Markov bases as the fibers of the minimum fiber Markov 
basis. 

Suppose that a degree two fiber J-"b contains more than one element, that is, |.Fb| > 2. 
Then no two elements n, n' of the fiber share a support: 

degb = 2, n^n'e^b ==> supp(n) n supp(n') = 0. 

It follows that each element of a degree two fiber with more than one element is an 
indispensable monomial (Aoki et al. (2008)), that is, each contingency table of sample 
size two is isolated and has to be connected to some other table in the same fiber by 
a degree two move of a Markov basis. Hence each degree two fiber with more than one 
element has to be a fiber of the minimum fiber Markov basis. This fact holds for any 
hierarchical model. Note however that for some hierarchical models, such as no three- 
factor interaction models (Aoki and Takemura (2003)), every degree two fiber has only 
one element. 

On the other hand, for decomposable models, Dobra (2003) has shown that there exists 
a Markov basis consisting of primitive moves. It implies that for decomposable models it 
suffices to study degree two fibers. In particular the fibers of the minimum fiber Markov 
bases are exactly the degree two fibers with more than one element. Furthermore, by 
the characterization of the uniqueness of minimal Markov bases in Takemura and Aoki 
(2004), it follows that the minimal Markov basis for a decomposable model is unique if 
and only if all degree two fibers contain at most two elements. Based on this result we 
will give a necessary and sufficient condition for the uniqueness of minimal Markov bases 
for decomposable models (Theorem 2 below) in terms of the properties of their chordal 
graphs. 

3. Structure of degree two fibers 

In this section we study the structure of degree two fibers to clarify the structure of 
primitive moves. Let T> = {Di, . . . , D r } be the generating class of a hierarchical model. 
Let b be a set of marginal frequencies of a contingency table with sample size two. 
We are interested in the structure of a degree two fiber J-\>. Because the sample size is 
two, for each D GT>, there exist at most two marginal cells in with positive marginal 
frequency hd^d) > 0. The same reasoning holds for each variable 5 £ A; namely in the 
one-dimensional marginal table {n^s}(is), is £ {0, 1, . . . ,Is — 1}}, there exist at most two 
levels is such that n^gy(ig) > 0. For a given b we say that the variable 5 is degenerate if 
there exists a unique level is such that n{gy(i$) = 2. Otherwise, if there exist two levels 
is ^ i' s such that n{<s}(i<s) = nrs}(i's) = L then we say that the variable 6 is non-degenerate. 
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If a variable 6 is degenerate for a given marginal b, then the level of the variable 8 
is uniquely determined from b and it is common for all contingency tables n G Ty,. In 
particular, if all the variables 5 € A are degenerate, then F\> = {n} is a one-element fiber 
with frequency n(i) = 2 at a particular cell i. Since this case is trivial, below we consider 
the case wherein at least one variable is non-degenerate. For convenience we denote 

n = (0(j) 

when n(i) = n(j) = 1, i ^ j . From the fact that there exist at most two levels with 
positive one-dimensional marginals for each variable, it follows that we only need to 
consider 2 x ■ • • x 2 tables for studying degree two fibers. Therefore, for our purposes in 
this section, we let 1\ = ■ ■ ■ = I m = 2,1= {0, l}" 1 without loss of generality. 

For a given b of degree two let Ab denote the set of non-degenerate variables. 
As noted above we assume that Ab ^ 0. Each n G T\, is of the form n = = 
(ii, . . . , i m )(i' 1 , . . . , i' m ), i 7^ i' ■ Furthermore, for non-degenerate 8 G Ab the levels of the 
variable S in i and i 1 arc different: 

= {0,1} V<5 G A b , 

or equivalently i'$ = 1 — i$, V<5 G Ab- In the following we use the notation ig = 1 — ig. 
More generally for a subset D = {5i, . . . ,5k} of the variables and a marginal cell Id = 
(i^,. . . ,ij k ) we write 

i*D = {i*8i>- = (l-i*i 5 ---)l-*« fc )- 

Let us identify n G J-\> with the set {i,i'} of its two cells of frequency one. Then we see 
that the number of elements |J-"b| of the fiber is at most 2^ Ab ' _1 . However some choice of 
{i, i'} with 

i 4 ,»;6{0,l} V(5GA b , 

may not be in the fiber J-b. This is because if 6 and 6' belong to a common D G X>, then 
the values of is and is> are tied together. For example, let D = {1,2} GT> and consider 
the {1, 2} -marginal specified as 

n { i, 2 }(0,0) = n {lja} (l,l) = 1, n {1 , a} (0,l) = n {1 , 2} (l,0) = 0. 

Then if we choose i\ = 0, we have to choose 12 = 0. In Takcmura and Hara (2007) we 
considered a very similar problem in the framework of swapping observations among 
two records in a microdata set for the purpose of statistical disclosure control. As in 
Takcmura and Hara (2007) we make the following definition. 

Let <5(Ab) be a graph with the set of vertices Ab and an edge between 5 G Ab and 
8' G Ab if and only if there exists some D GT> such that 6,6' G D. Namely there exists 
an edge between two non-degenerate variables if and only if these two variables appear 
together in some marginal tables of T>. Note that £?(Ab) is the induced subgraph of Q v 
with the vertices restricted to Ab . As discussed above in this case the values of is and if 
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are tied together and once the value of is is chosen, for example, ig = 0, then the value 
of is* becomes fixed depending on the specifications of the marginals jijj. 
We summarize the above argument in the following lemma. 

Lemma 1. Suppose that b is a set of consistent marginal frequencies of a contingency 
table with sample size two. Let T be any subset of a connected component in £?(At>). Then 
the marginal table nr = {nr(ir) \ ir G2Tr} is uniquely determined. 

Proof. Let r(T) be the number of generating sets D ET> satisfying lnD^0. We prove 
this lemma by induction on r(T). When r(T) = 1, the lemma obviously holds. Suppose 
that the lemma holds for all r(T) < r' and we now assume that r(T) = r' . Let li cl and 
T 2 C T satisfy 

r!ur 2 = r, r 1 nr 2 ^0 7 r(r 1 )</, r(r 2 )<r'. 

Since r(ri) < r' and r(T2) < r' both tir 1 and nr 2 are uniquely determined. Suppose that 

^r 1 (ir 1 \r 2 ,«r 1 nr 2 ) = l, ^(^vr^^nrs) = 1 - (3-1) 
Then from the consistency of b there uniquely exists «r->\ri £^T 2 \ri such that 

«r 2 (ir 2 \r 1 ,«r 1 nr 2 ) = l, n T2 (i r2 \ ri ,i r ^nr 2 ) = 1 - (3-2) 
Hence the table nr = {"(jr) | jr & lr} such that 

n( ^\ = / 1. if Jr = (ir^r^i^nr^ir^) or jr = (i rAr2 ,if inr ,,i rAri ), 
urJ \0, otherwise 

is consistent with the marginal b. 

Suppose that there exists another marginal table n r that is consistent with b such 
that nr(jr) ="r(jf) = 1 an d jr ^ (*ri\r 2 7*rinr 2 ,*r 2 \ri)- Then we have at least 

nr 1 (ir 1 ) = or nr 2 («r 2 )=0. 

This contradicts (3.1) and (3.2). □ 

By using the result of Lemma 1, we obtain the following theorem on the number of 
elements in degree two fibers. 

Theorem 1. Let J-y, be a degree two fiber such that Ab ^ and let c(b) be the number 
of connected components o/£/(Ab). Then 
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Proof. Denote by T\, . . . ,T C , c = c(b), the connected components of (J(A b ). Define r c+ i 
by r c+ i = A \ A b . Then there exists ir c+ i such that 

*r c+1 = {is | 8 £ T c+1 ,ri{s}(is) = 2}. 

From Lemma 1 the marginal cells ir k such that nr k (ir k ) = nr k («p ) = 1 uniquely exist 
for k = 1 , . . . , c. Now define lb by 

lb = {irnirj x {*r 2 , «f 2 } x ••• x {ir c ,«r c } x {«r c+1 }, 

where x denotes the direct product of sets. Suppose that j £ lb- Define rij = {n,j(i) \ i £ 
1} by 

n .( i ) = / 1 ' if* = j'ori = j*, 
3 v ' [0, otherwise. 

Then we have J"(J b ) = {nj | j £ T b } C J" b and | J"(Z b )| = 2 C_1 . 

If there exists n' = {n'(i) \ i £ X} such that n' £ F\y and n' ^ ^(It,), then there exists 
a cell j £ I and 1 < k < c + 1 such that n(j) = 1 and jr k ^ «r fc • This implies that there 
exists Di £T> such that n'{iDi) ^ n(iD,)- Hence we have \J-~b\ = 2 C ^~ 1 . □ 

As mentioned in Section 2, for a consistent b such that degb > 2, it is known that 
Fb may be empty (e.g., Irving and Jerrum (1994); Vlach (1986)) in general. However 
Theorem 1 shows that, in the case degb = 2, if a consistent b such that A b ^ is given, 
then Tb 7^ for any hierarchical model. 

It is helpful to consider permuting the levels 0f>l for each variable and under- 
stand Theorem 1 in a canonical form. This amounts to considering invariancc of hi- 
erarchical models with respect to permutation of levels of each variable as studied in 
Aoki and Takcmura (2008a). Although we have reduced our consideration to 2 m tables 
in treating degree two fibers, we are really considering general hierarchical models of 
I\ x • • • x J m tables. Note that hierarchical models possess the symmetry with respect 
to relabeling the levels of each variable, that is, it is invariant under the action of the 
direct product of symmetric groups Si 1 x • • • x Si m acting on the set of cells. If we again 
restrict our consideration to degree two fibers, we only need to consider the action of 
S™ = S2 x • • • x S , 2 . It is clear that structures of degree two fibers are invariant under the 
action of S™. 

In particular as a "representative fiber", we can consider b such that the levels of 
all degenerate variables are determined as 0. Also for such a b, let T C A b be the set 
of vertices of a connected component of (7(A b ). Then we can without loss of generality 
assume that two r-marginal cells of frequency 1 are specified as 

l = n r (0,0,...,0) = n r (l,l,...,l). (3.3) 

This can be achieved by interchanging the levels of each variable in A b . Under this stan- 
dardization the proof of Theorem 1 is easier to understand, because for each connected 
component of <?(A b ) we either choose all 0's or all l's for the component. 



216 



H. Hara, S. Aoki and A. Takemura 



This standardization is also useful in determining the setwise stabilizer of J-t, in S™ 
(Section 3.1 of Aoki and Takemura (2008b)). If we standardize the levels as (3.3), then 
the setwise stabilizer of T\, is isomorphic to c(b)-fold direct product of 5*2 's: 



In other words the structure of J-b is equivalent to the structure of the fiber Tv with 
A = Ab' = {1, . . • ,c(b')} and totally disconnected (J(Ab')- In the next section we use this 
fact in determining the minimal invariant Markov bases for decomposable models. 

Finally we prove the following theorem on a sufficient condition for non-uniqueness of 
minimal Markov bases. 

Theorem 2. Let T> = {D\, . . . , D r } be the generating class of a hierarchical model. Sup- 
pose that m > 3 and there exist three variables 61,62,63 that are not connected to each 
other in Q v . Then minimal Markov bases for the hierarchical model with the generating 
class T> are not unique. 

Proof. It suffices to find a degree two fiber with more than two elements. Consider b such 
that Ab = {61,62,63}. From the condition of the theorem C?(Ab) has an induced subgraph 
with three connected components. Therefore |J-b| =4. This completes the proof. □ 

4. Markov bases for decomposable models 
4.1. Minimal and unique minimal Markov bases 

In this section we discuss Markov bases of decomposable models in detail from the view- 
point of minimality based on the results obtained in the previous section. Since there 
exists a Markov basis consisting of primitive moves for decomposable models, the set of 
fibers of the minimum fiber Markov basis coincides with the set of degree two fibers with 
more than one element. Theorem 1 of the previous section enables a complete description 
of minimal Markov bases of decomposable models. 

Let degb = 2. Let 7b be any tree whose nodes are elements of J-b- Denote the set of 
edges in 7b by Mr b - We note that we can identify each edge (n, n') € M.j- b with a move 
z = n — n'. So we identity A4j- b with a set of moves for J-b- In considering Markov bases, 
we ignore the sign of z and identify z = n n' with z = n' n and consider the edges 
in 7b as undirected. In contrast when we consider Grobner bases, we distinguish z from 
— z and correspondingly consider directed edges. 



5 2 c(b) = S 2 x • • • 



xS 2 . 



Let £>nd be 



B nd = {b|degb 



2,|-7b|>2}. 



(4.1) 



Then we define M° as follows, 




(4.2) 



heB, 
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By following Dobra (2003) and Takemura and Aoki (2004), we easily obtain the fol- 
lowing theorem. 

Theorem 3. A4° is a minimal Markov basis and (4-2) is a disjoint union. Conversely 
every minimal Markov basis can be written as in (4-2). 

Example 1 (The complete independence model of three-way contingency ta- 
bles). Consider the model V = {{1}, {2}, {3}} for the 2x2x2 contingency tables. £>„d 
for the model has seven elements. Denote them by bi, . . . , by. Figure 1 shows an example 
of M.Tb t f° r £ = 1, - - . , 7. bi, . . . ,b7 satisfy 

A bl ={1,2,3}, A b2 =A b3 ={l,2}, 

(4-3) 

A b4 =A b5 ={2,3}, A b6 =A b7 ={l,3}. 

The union of all these moves is a minimal Markov basis for the model. Since J-" bl is a 
four elements fiber, Tb 1 is not uniquely determined. Hence minimal Markov bases are 
not unique for this model. 

As seen from this example, minimal Markov bases are not necessarily uniquely deter- 
mined. Based on Theorems 1 and 3, we can derive a necessary and sufficient condition 
on decomposable models to have the unique minimal Markov basis. 

Corollary 1. There exists the unique minimal Markov basis for a decomposable model 
if and only if the number of connected components in any induced subgraphs of Q v is less 
than three. 

Proof. Suppose that £?(A b ) has more than two connected components. Then since 
|-7"b| _ 4 from Theorem 1, 7b is not uniquely determined. For a different tree 7^, 
M.Tb 7^ M.f. Hence minimal Markov bases are not unique either. 

Conversely, assume that the number of connected components of (?(A b ) for all b G £>„d 
is two. Then Tb for all b £ 23 n d is uniquely determined. Hence the minimal Markov basis 
is unique. □ 

For decomposable models Q 1 ^ is chordal. From the graph theoretical viewpoint the 
above corollary can be rewritten as follows. 

Corollary 2. For a decomposable model, there exists the unique minimal Markov basis 
if and only if Q v has only two boundary cliques D and D' such that D" C D U D 1 for all 
D"eV. 

Proof. Suppose that Q v has two boundary cliques D and D' such that D" cDUD' 
for all D" £T>. Then any vertex in D" is adjacent to D or D' . Hence the number of 
connected components for any induced subgraph of Q 1 ^ is at most two. 
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( (()()())(iii) )-( (noi)(iii)) } 

( (011)(100) ) ( (010)(101) ) 



((000)(110) ) - ( (010)(100)) ((0Q1)(111) )-( (011)(101)) 



Mr, Mr. 



( (000)(011) )- ( (010)(001) ) ((100)(111) ) - ( (110)(101) ) 



Mr, Mt. 
be o. 



( (000)(101) )-( (001)(100) ) ((010)(111) )-( (0U)(U0) ) 
(The triplets in brackets refer to cells in the contingency table.) 

Figure 1. Mr bt in the complete independence model of three-way contingency tables. 




Conversely suppose that there exists D" € T> such that D" <f_ D U D' . Then the sub- 
graph induced by the union of D" \ (DU D'), Simp(D) and Simp(D') has three connected 
components. □ 



The graphs with r = 2 always satisfy the conditions of the theorem. For r > 3 the 
graph with 

V = {{l,...,r-l},{2,...,r},...,{r,...,2r-2}} (4.4) 

satisfies the conditions of the theorem. Figure 2 represents the graphs satisfying (4.4) for 
r = 3,4. We can easily see that any induced subgraph of the graphs in the figure has at 
most two connected components. 
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Let T= (D,£) be a clique tree for Q v . Denote by T e = (T> e ,£ e ) and 7^ = (T>' e ,£' e ) the 
two induced subtrees of T obtained by removing an edge e £ £. Define V e and V e ' by 

7 e = |J D, K'= |J D. 

Let A / l r (V r e , V e ') be the set of all primitive moves for the decomposable model determined 
by the chordal graph whose set of cliques is {V e ,V^}. Dobra (2003) showed that 

M T = \jM T (V e X) (4-5) 

ee£ 

is a Markov basis. We call M. T a Dobra's Markov basis. From the viewpoint of minimality 
of Markov bases, we have the following theorem. 

Theorem 4. A decomposable model has a clique tree T such that MC~ is a minimal 
Markov basis if and only if the model has the unique minimal Markov basis. 

Proof. When a decomposable model has unique minimal Markov basis, Ai' coincides 
with the minimal Markov basis. 

Suppose that there exist three vertices in (j that are not adjacent to each other. 
Let 1,2 and 3 be such three vertices and assume that I € Di, D\ ET>, for I = 1,2,3. 
Define {1, 2, 3} c = A \ {1, 2, 3}. Consider a degree two fiber T\y such that Ab = {1, 2, 3} 
and ^{i.2,3} c (*{i,2.3}°) = 2 for some i{i.2,3}°- Then \T\,\ ~ 4 from Theorem 1 and we can 
denote these four elements by 

ni = (000i{i )2 ,3}'=)( 111 Hi,2,3}0i 

n 2 = (00H {1)2>3} o)(110i {li2i 3 } c), (4.6) 

n 3 = (010^i i2 ,3}<=)( 101 Hi,2,3}=)> 
n 4 = (01H {lj2i3} =)(100i { i )2i3} c). 

A minimal Markov basis connects these four elements by three moves. Let T= (?),£) be 
any clique tree for Q" and T' = (V,£') be the smallest subtree of T satisfying Di £ V 
for 1 = 1,2 and 3. Then we can assume that T' satisfies either of the following two 
conditions, 

(i) £>2 is an interior point and D\ and D% are endpoints on the path; 

(ii) all of D\, D2 and D3 are endpoints of T 7 . 

In both cases there exists e £ £ such that Di,D 2 C V e and D 3 C V' e . Then M T (V e ,V^) 
includes the following two moves, 



zi=ni-n 2 , Z2 = n 3 -n 4 . 
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Figure 3. T in Example 2. 



On the other hand, there also exists e' 6 £ such that D\ C V e > and D2,Z?3 C V e > . In 
this case (y^ ,V' e ,) includes the following two moves, 

z 3 =ni-n 4 , z 4 = n 2 -n 3 . 

Thus includes at least four moves for the fiber J\,, which implies that JvC is not 
minimal for the model that does not have the unique minimal Markov basis. □ 

Example 2 (The complete independence model of four-way contingency ta- 
bles). Consider the 2x2x2x2 complete independence model V = {{1}, {2}, {3}, {4}}. 
Let Jb be the fiber with A b = {1,2,3,4}, that is, c(b) = 4 and |J" b | = 8. Consider M T 
for T in Figure 3. Denote the set of moves for Tb belonging to M 1 * by M.^. Figure 4 
shows .M^ • As seen from Figure 4, A4^ includes 12 moves. Since |J-b| = 8, 7 moves are 
sufficient to connect T\>. 



4.2. Minimal invariant Markov bases 

In this section we consider Markov bases from the viewpoint of invariance under the 
action of the product of symmetric groups G = Gi lt ...j m = x • • • x Si m on the levels 
of the variables. The organization of this section is as follows. We first express a minimal 
invariant Markov basis as a union of orbits of Gi 1 _...j m , which minimally connects repre- 
sentative fibers (see (4.7) below). Then we show that the minimal set of orbits connecting 
a non-degenerate fiber is in one-to-one correspondence to a basis of a vector space over 
the finite field GF(2) (Lemma 2 and Theorem 5 below). Then the structure of minimal 
invariant Markov bases is given in Theorem 6. 

According to Aoki and Takemura (2008a), a set of moves M is called G-invariant if 

geG, zeM => g(z)eM or -g(z,)eM. 

A4 is called a G-invariant Markov basis for T> if it is a Markov basis and also G-invariant. 
A G-invariant Markov basis M is minimal if no proper G-invariant subset of A 7 ! is a 
Markov basis. 

As discussed at the end of Section 3, by appropriate reordering of the indices we can 
consider a representative fiber 

|A\A b | 

^9njE(0-0)(l-lCo). 
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Then any n 6 is expressed as follows, 



|Ti | |A\A b | |ri| |A\A b | 

n = (0^0zr 2 • ■ • zr c(b) 0^0)(T^Tzf 2 • ■ -i* r<b) 0^0), 
|r,| |r ; 
jr,=0---0 or ir, =1---1, I = 2, ...,c(b), 



where L; are the connected components of Q(Ab). Let G Vl ,1 = 2,..., c(b), be the diagonal 

G r >={g = (g,...,g)\geS 2 }ciSl r ' 1 . 



IP 

subgroup of 5*2 defined by 



Define 

G b = G F2 x • • ■ x G r -i b > 
and let g G Gb act on n G J 7 ^ by 

|ri| |A\A b | |A\A b | 

g(n) = (0^0 g 2 (i Ta ) • • • &(b) (ir etb , ) 0^0 ) (1^1 £ 2 (*r 2 ) • ' • ffc(b) (ij a(b) ) 0^0 ) . 

Clearly g(n) G J 7 ^ for neJ^ and furthermore for any n6 Jjj there exists g G Gb such 

that n = g(n b ). This shows that Gb C Gi 1 i m is the setwise stabilizer of J-Q acting 

transitively on J 7 ^. 

Let Mc b be a minimal Gb-invariant set of moves that connects J 7 ^. Let re(b) be the 
number of Gb-orbits included in Mc b - As representative moves of Gb-orbits in A4a b we 
can consider 

z h k =n*-n h k eM h , n£ej£, k = 1, . . . , K (b). 
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This is because we can always send n in z = n - n' to nff by the transitivity of Gb. 
Denote = {z b , . . . , z^/ b \}. Define the set of representative fibers by 

B° nd = {b\n*e^}cB nd . 

From Aoki and Takemura (2008a) a minimal Gj^... ^-invariant Markov basis can be 
expressed by 

K(b) 

M G = (J U G ^.. .,i m (*k), ( 4 -7) 
beZ3° d fc=i 

where Gi lr ..j m (z^) denotes the i m -or bit through z b . Hence in order to clarify the 

structure of A4g, it suffices to consider 2 x • • • x 2 tables and investigate «(b) and M.% 
for each JQ. 

As mentioned in Section 3, the structure of J-^ is equivalent to the structure of the 
fiber with Ab = A = {1, . . . , c(b)} and Q(A\,) is totally disconnected. We first consider 
the structure of such a fiber. satisfies 

•?b = {( *2---j c (b))(l«2---i*(b)) I («2"-i c (b))=*A\{l} e^A\{i}} (4.8) 

and (0 ■ • -0)(1 - -- l) G J%. Then we note that we can identify G b with S^ -1 . For g G 
S% 1 , wc write g = (52, ■ • ■ ,9c(b))i where gi G S2 for 1 = 2,..., c(b). A representative 
move of S2 1 -orbit is expressed by 

z b = (0---0)(l-.-l)-(0 iAUl} )(l l l Ul} ) 

for some i A \{i} g2a\{i}- Wc first derive «(b) and M Gb . Let V^' 1 = {0, l}^" 1 
denote the (c(b) — l)-dimensional vector space over the finite field GF(2), where the 
addition of two vectors is defined as the "exclusive or" (XOR) of the elements. Let © 
denote the XOR operation. Let o denote the group operation of S^ 1 ^ -1 . Then we obtain 
the following lemma. 

Lemma 2. 1 * s isomorphic to y c ( b ) _1 . 

Proof. Consider the map <j): S^^' 1 — > V c ^ b - )_1 such that <p(g) = v = (i>2, ■ ■ ■ ,f c (b)) 6 
yc(b)-l ; where 

_ (0, if gi{i{) = %i, 
Ui \l, if Si (*i)=*j 

for / = 2, . . . ,c(b) and = {0, 1}. For g' = (g' 2 ,.. .,g' c(b) ) G S^ -1 , ^ G S 2 , and v' G 

V c(b)-i ; define 0^/) =v ' = (v' 2 ,...,v', b) ). Then we have 4>(g o 5 ') = v = (u 2 , . . . ,« c ( b )), 



'*£ 9i°9i(it) = ih 
1, X 9i ° 9i(ii) = 
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veV*' -1 , where 

for 1 = 2,..., c(b). Hence we have 

Ui=u;©<U(, Z = 2,...,c(b). 

Therefore is a homomorphism. It is obvious that is a bijection. Therefore 1 is 

isomorphic to VWO -1 . □ 

Based on this lemma, we can show the equivalence between 1 -orbits in a minimal 

^,c(b) 1 _ mvar j an ^ se ^ £ moves that connects F b and a (vector space) basis of y c ( b ) _1 . 

Theorem 5. Let V° = {vfe = (v k 2, • • • , v kc ^y), = 2, . . ., c(b)} oe any 6aszs o/V c ^ b - )_1 . 
De/?ne n b , n b fc eJg 6y 

n b = (00- • - 0)(11 • • • 1), < = (0v k2 ■ ■ ■ v kc{b) )(lv* k2 ■ • -< c(b) ), 

where v kl = l®Vki- Let M.Q b be an 1 -invariant set of moves in T b . Then M.G b is a 

minimal S 2 ^ 1 -invariant set of moves that connects if and only if the representative 

moves of the S% 1 -orbits in Mc b are expressed by z b fc = n b — n b fc , k = 2,...,c(b). 
Hence /s(b) = c(b) — 1. 

Proof. Suppose that M.G b is a minimal £2 1 -invariant set of moves that connects 
J- b and that M.c b includes ft(b) orbits as S% 1 (z b ),..., 1 ( Z K(b))' wriere 

Z fe = n - n fc: n fc = ( *fc2 • ■ ■4c(b))(life 2 ' • ■ *L(b)) 

for ifciGl;, fc = l,...,«(b), / = 2,...,c(b). Let j t eSf )_1 satisfy .g fc ( n o) = n fe for 
fc = l,...,«(b). We write g k = (g k2 , . . . ,g kc (b)), 9kl G <52 for Z = 2,...,c(b). Let i? b = 
{g 1 , . . . , g K ^} C §2 be a subset of . As mentioned above, T b can be ex- 

pressed as in (4.8). Hence for any n € J 7 ^ there exists g G S^ b ^ 1 satisfying n = g(n b ). 
A^Gb connects F b if and only if there exists p < /c(b) such that 

n = n b - z b - ^ (z b ) o • ■ • o «^ (z b p ) 

and g = g kp o • • • o Hence A^G b is a minimal S^^ ^-invariant set of moves that 
connects LF b if and only if H b satisfies 

V ff 6 S 2 c(b) ~\ 3p < K (b), 3/ 1 6i/ b ,..., 3 5 fe - G H b s.t. 5 = o ■ • ■ o g k > (4.9) 
and no proper subset of H b satisfies (4.9). 
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Denote V° = (f>{H h ) C V c(b)_1 . Then from Lemma 2, (4.9) is equivalent to 

Vve V,3v 2 e V°,...,3v p+ i G V° s.t. v = v 2 e---©v p+ i. (4.10) 

From the minimality of M.G b no proper subset of V° satisfies (4.10). This implies that 
V° is a basis of V c ( b ) _1 and hence /-t(b) = c(b) — 1. If we define g k ~ (j)~ 1 (vk+i) for 
k = 1, . . . ,c(b) — 1, we have <to(0) = v k +i,i and hence g fe (n b ) = n b = n b fc+i . Therefore 

z b fc , k = 2, . . . , c(b), are the representative moves of the .S'^ 1 ''' 1 -orbits in M.c b ■ 

Conversely suppose that the representative moves of Aia b are z b fc , k = 2, . . . , c(b). 
V° satisfies (4.10) and no proper subset of V° satisfies (4.10). Hence if we define g k = 
-1 ( v fc+i) and H h = {g 1 ,...^^- 1 }, H h satisfies (4.9) and no proper subset of i/ b 
satisfies (4.9). Hence M.a b is a minimal 1 -invariant set of moves that connects 

Jb. □ 

For example, we can set V° = {v2, . . . , v c( - b )} as 

V 2 = (ll-.-ll), V 3 = (01"-ll), 

v c(b) _ 1 = (00---011), v c(b) = (00---01), 
and then the representative moves in a minimal G-invariant Markov basis are 
z° 2 = (00-"0)(ll-"l)-(011-"ll)(100---00), 

zS = (oo---o)(ii---i) - (ooi---ii)(no---oo), 

(4.11) 

z °(b) = (00---0)(ll---l) - (000---01)(lll---10). 

So far we have focused on such that A b = A = {1, . . . , c(b)} and <7(A b ) is totally 
disconnected. Now we consider a fiber for a general b of a general decomposable model. 
Define g kl £ G r ' by 

\r t \ 

n frT^ - / " ' °> ifwfc+i=0, , . 

• 9h(0 "- 0) -\i-..i, if« fc+1 = i (412) 

for fe = 1, . . . , c(b) — 1 and Z = 2, . . . , c(b) and define g fc € G b by 

|ri| |A\A b | 
/(n) = (0---0ff fe 2(ir a )---fljfec(b)(»r e(b) ) 0---0) 
|ri| |A\A b | 
x (l---l5A ; 2(ir 2 )---5Mb)( i r c(b) ) 0---0). 



(4.13) 



Denote n b = o fe (n b ) and z b = n b — n b . Based on (4.7) and Theorem 5, we can 

Vfc+l 3 \ U/ Vfc+i U Vfc+i V I ! 

easily obtain the following result. 
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f Mi 



((000)(111) 

((Qii)(ioo) 



(001)(110) 



(010)(101) 



(g((000)(111)])-(G((001)(110» 
(G((011)(100)]P(g((010)(101))' 



Mg, 



( (000)(110) 



(010) (100) 




G(Mg. 



(G((000)(110)])-(G((010)(100))) 



(G((000)(011)])-(G((010)(001)) 



G(Mg. J) 



(G((000)(10ljj)-(G((001)(100)^ 



Figure 5. The structure of minimal G2,2,2-invariant Markov bases for the complete indepen- 
dence model of three-way contingency tables. 



Theorem 6. A4c b is a minimal 1 -invariant set of moves that connects if 

and only if the representative moves of the 1 -orbits in A4c b o^e expressed as z^, 

k = 2, . . . , c(b) . Hence re(b) = c(b) - 1 . Then 

c(b) 

M G = (J (J G Iu ... Jm (4) 

beB° d k=2 

is a minimal Gi^ j m -invariant Markov basis. Conversely, every minimal Gi u j m - 
invariant Markov basis can be written in this form. 

Example 3 (The complete independence model of three-way contingency ta- 
bles). Define b t as in Figure 1 of Example 1. Then B® d = {bi, b2, b4, be}. Figure 5 
shows a structure of M.q for the complete independence model of 2 x 2 x 2 contingency 
tables. The left half of the figure shows the structure of Mg^ , b t 6 B® d . 
c(bi) = 3 and hence «(bi) = 2. If we set vj 5 = (10) and v£ = (01), we have 



z^ 1 = (000)(111) - (010)(101), z* 1 = (000)(111) - (001)(110). 
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Figure 6. The clique tree with two endpoints. 



The orbits Sf (z^ 1 ) and 5*1 (z!? 1 ) are expressed in dotted lines and solid lines, respectively, 
in the figure. 

c(b t ) = 2 and ft(b t ) = 1 for t — 2,4,6. There exists one orbit in Aic b for t = 2,4,6. 
Then from Theorem 6 a minimal G2.2,2-invariant Markov basis is expressed by 

M.G = G(z^ ) U G(z 2 31 ) U G{z\ 2 ) U G(z^ ) U G{z\ 6 ) . 

Next we consider a Dobra's Markov basis MT from the viewpoint of invariance. Since 
does not depend on the levels of the variables, is G^....^ -invariant. Based on 
the result of Theorem 5, we can show that JvC is not always a minimal invariant Markov 
basis. 

Theorem 7. A4^~ is minimal invariant if and only if T has only two endpoints. 

Proof. It suffices to show that the theorem holds for 2 x • • • x 2 tables. Suppose that 
T= (£>, £) has more than two endpoints. Let Di, D 2 and D3 be three of them. Then 
they are boundary cliques. Suppose 1,2,3 G A are simply separated vertices in D±, D 2 
and D3, respectively In the same way as the argument in the proof of Theorem 4, there 
exist e, e', e" G £ such that 

D±,D 2 G v e , Dsev;', 

D 2 ,D 3 eV e ,, D\ G V' d , 
£) 3 ,-DiGF e », D 2 eVi„. 

Consider the moves for the fiber for b such that Ab = {1,2,3}. Define Z5 and Z6 by 

Z5=ni-n 3 , z 6 = n 2 -n 4 , 

where Hi, . . . ,114 are defined in (4.6). Then we have 

z 1 ,z 2 eM T {V e X), v 3 ,Zi£M T {V e ,X>), z 5 ,z 6 eM T (V e „X><)- 

We note that {zi,z 2 }, {z3,Z4} and {z5,Zg} are S'f-orbits in A4^. Since n(h) = 2, M 1 ' 
is not minimal invariant. 

Suppose that T has only two endpoints. Then T is expressed as in Figure 6. Let 
ri(b), . . . , r c (b)(b) be the c(b) connected components of C/(Ab). Suppose that 5\ G L;(b). 
The structure of i s equivalent to the structure of J^, such that Ab' = {Si,. ■ ■ , <5 c (t>)-i} 
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and £?(Ab') is totally disconnected. So we restrict our consideration to such a fiber. 
Denote by jQ, the representative fiber for b'. Let 

Mw = {n - n' | n, n' e jg, , n ^ n'} 

denote the set of all moves in JQ,. Without loss of generality we can assume that Si G 
D n(l) , where tt(1) < • ■ • < 7r(c(b')). Define e t = (A-i, A) G 5, 5j = A-iHA, VJ = \# 
and V{ = V e ' \ Si for / = 2, . . . , c(b'). Then the moves in M r (Vi,V l ') are expressed by 

z = (ivi,iv{,isi) Uvi , 3v{ ,isi)-(ivi,3v{,isi) (jv, ,iv t ',is t ), 
W x , jvi 6 2V, , , Jv/ G ^y/ , *s, e X Si . 

If V ei n A b /_= or V;' ; n A b < = 0, then we have M r (V ei , V e ' ) C\Mw = 0. If V e , n A b < ^ 
and VI fl A b ' ^ 0, then there exists 2 < &(ej) < c(b') satisfying Sk G V; for all fc < fe(e;) 
and S k G V/ for all ft > k(e{). Then 

^ r (v e „K,)nA< b , = s 2 c(b) - 1 ( ai » (ei) ) ) 

where z°( e; ) is defined as in (4.11). Hence we have 

Ml= |J M r (v ei X l )nM h/ = X)sf' ) -\4), 

which includes c(b') — 1 orbits for all b' G <B° d . Hence Al 7 " is minimal Gi lt .. i m - 
invariant. □ 

Example 4 (The complete independence model of four-way contingency ta- 
bles). As an example we consider the 2x2x2x2 complete independence model 
V = {Di = {/}, I = 1, . . . , 4}. Both T 1 and T 2 in Figure 7 are clique trees for D. From 
Theorem 7, MC is a minimal S^-invariant Markov basis. Consider the representa- 
tive fiber such that Ab = {1,2,3}. For j = 1,2, denote the two induced subtrees 
of T J obtained by removing the edge e; by Tl and Tl ■ Figure 8 shows Tl, 72 an d 
M rl (V ei , V ) nMb- If we remove e 3 from T 1 , 1, 2 and 3 arc still connected and hence 
M Tl (V es , F e ' 3 ) n TWb = 0. Therefore Mf 1 includes «(b) = 2 orbits. 

On the other hand, since 7~ 2 has three endpoints, .M 7 " is not a minimal Sf-invariant 
Markov basis. Figure 9 shows T 2 , T 2 ' and M r (V en V^)DM h . Wc can see that M.T in- 
cludes three orbits. As seen from this example, in general the minimality of depends 
on clique trees 7~. 

Example 5. We consider the model defined by the chordal graph in Figure 10. The 
clique tree of this graph is uniquely determined by T 2 in Figure 7. As seen from this 
example, there exist decomposable models such that Ai 1 ' for every clique tree T is not 
minimal G/ lr . .^-invariant. 
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Figure 7. Clique trees for the 4-way complete independence model. 



4.3. The relation between minimal and minimal invariant 
Markov bases 

From a practical point of view a Gi lt „ i m -invariant Markov basis is useful because its 
representative moves give the most concise expression of a Markov basis. On the other 
hand, a minimal Markov basis is also important because the number of moves contained 
in it is minimum among Markov bases. Here we consider the relation between a minimal 
and a minimal Gi u „ i m -invariant Markov basis and give an algorithm to obtain a minimal 
Markov basis from representative moves of a minimal Gi lt ... t i m -invariant Markov basis. 

As mentioned in the previous section, the set of Gb-orbits in a minimal Gb- 
invariant set, Mc b , of moves that connects has one-to-one correspondence to 



^(0000)(1110)^ ^(0010) (1100)^ 



(joiio)(iooo)^ (joioo) (ioioT) 



£>i\_( D 2 1 [ £>3l— ( D 



(joooo)(mo)^ (\mw) (hop) 



^(0110)(1000) ^ ^ (0100) (1010) 



T e i and 7-1' M Tl (V e , ,V^)nM b 

Figure 8. The structure of Mi . 
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and 7^' 



Qoooo)(mo)~) ftooio) (nooA 



(J0110)(1000)^ (^(0100) (1010)^) 



( joooo) (mon ^(ooio) (nooT ) 



s / 
/ \ 

\ 



Qoiio)(iooo)J Qoioo)(ioioA 



^(0000) (1110)^ ^(0010) (1100) 



(joiio) (looo)^ ^(oioo)(ioioy^ 



M T (V ei ,V')nM b 



Figure 9. The structure of M{,' 




Figure 10. A chordal graph whose clique tree is uniquely determined. 
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a basis V° of V c(b)-1 . Define g H <= G Vl and g k e G b as in (4.12) and (4.13). Let 
i?b = {g 1 , ■ ■ ■ , g ^' 1 } C Gb- Now we generate a set of moves .M b in Tb by the fol- 
lowing algorithm. 

Algorithm 1. 

Input: Jb, J ff b = {.9 1 ,...,.9 c(b) - 1 } 
Output: X* 

begin 

Choose any element ni in J~b] 

for k = 2 to c(b) do 

begin 

for / = 1 to 2 fc " 2 do 
begin 

n i+2fe -2 :=ff fc - 1 (n J ); 
z ;+2 fc - 2 : = n i — n (+2 fc - a ; 

>l b ^X b U{z i+2fc - 2 }; 
end 
end 

return -M b ; 

end. 



Theorem 8. _M b generated by Algorithm 1 is a minimal set of moves that connects Tb- 

Proof. Since \M%\ = 2° + 2 1 + • • • + 2 c < b '- 1 = 2 C ^~ 1 - 1, it suffices to show that n ; ^ n r 
for l^V . Suppose that there exist I and I', I y*= I' , such that m = n;/ and n;, ni> are 
expressed by 

n ( =g kp o---og kl (m), m> =g k 'p' o • ■ • o 5 fe i (m), 

where fci < fc 2 < ■ • ■ < k p < c(b) - 1 and k[ < k' 2 < ■ ■ ■ < k' , < c(b) - 1. Without loss of 
generality we can assume p < p' . Then we have 

g k " o • • • o g kl = gK' o • • • o 5 fe 'i (4.14) 

and there exists I <p such that ki ^ k[. From Lemma 2 (4.14) is equivalent to 

v fcl ©•■•evfe =vm e-evy, 

which contradicts that V° is a basis of y°( b ) -1 . Hence we have n; ^ n;/ for l^V . □ 

From (4.2) we obtain the following result. 
Corollary 3. M* = lj bg g .A4 b is a minimal Markov basis. 
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n,i n i 




G h {z 2 ) 

G b (z 3 ) 

Figure 11. Ma b and Ml, generated by Algorithm 1. 



Example 6 ( The complete independence model of a four-way contingency ta- 
ble). Wc consider the same fiber as in Example 2. Define V° = {v2, V3, V4} by V2 = (100), 
V3 = (010) and V4 = (001). Figure 11 shows Aic b an d .M£ generated by Algorithm 1 
with ni = (0000)(1111). 

5. Grobner bases for decomposable models 

So far we have been discussing Markov bases. In this section we briefly discuss Grobner 
bases. For decomposable models, Theorem 4.17 of Ho§ten and Sullivant (2002) gives a 
recursive method for determining the term order and the corresponding Grobner basis 
consisting of primitive moves only. It gives a Grobner basis version of Dobra's Markov 
basis in (4.5). In Theorem 4 we saw that Dobra's construction gives a minimal Markov 
basis only in a special case. The same phenomenon can be observed with respect to 
the reducedness of the Grobner basis if wc simply apply Theorem 4.17 of Ho§ten and 
Sullivant recursively, that is, the operation of Theorem 4.17 of Hostcn and Sullivant does 
not preserve reducedness in general. Here we are interested in an explicit description 
of appropriate term order and the reduced Grobner basis for decomposable models. We 
prove that for decomposable models, there exists a term order such that the reduced 
Grobner basis is explicitly described and, furthermore, it is minimal as a Markov basis. 

In obtaining a nice Grobner basis, the term order has to be carefully chosen. For 
example, consider the simple case of 3 x 3 two-way contingency tables with fixed row 
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sums and column sums. Proposition 5.4 of Sturmfcls (1996) shows that the set of nine 
primitive moves of the form 



± 



+1 


-1 


-1 


+1 



constitute a reduced Grobner basis when the cells are lexicographically ordered and the 
term order is chosen to be the reverse lexicographic term order. However, if we order the 
nine cells as 



1 


8 


6 


4 


2 


9 


7 


5 


3 



and use the lexicographic order, then the reduced Grobner contains the following degree 
three move 






-1 


+1 


+1 





-1 


-1 


+1 






in addition to the nine primitive moves. This example shows that the existence of a 
reduced Grobner basis consisting of primitive moves depends on the choice of a term 
order. 

We need several steps in constructing a nice term order for a decomposable model of 
an m-way contingency table. First, we order m variables. Choose a boundary clique of 
the chordal graph corresponding to the decomposable model and order the variables in 
the boundary cliques as the lowest variables. Then remove the boundary clique from the 
chordal graph, choose a boundary clique from the smaller graph and order the variables 
from the boundary clique as the next lowest variables. By recursively removing boundary 
cliques we obtain an ordering of variables. The resulting order is a perfect elimination 
scheme but has a stronger property. Second, given the order of the variables, we order 
the cells of an m-way contingency table lexicographically. Finally, as the term order >>- 
we use the reverse lexicographic term order. 

Let B n d as in (4.1). In each fiber J 7 ^, b £ B n d, there exists the lowest element n£ with 
respect to the above term order >-. Define 

m gb = (J U 

beB nd ngfb 

Then we have the following theorem. 
Theorem 9. A4 GB is the reduced Grobner basis and it is minimal as a Markov basis. 
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We omit the details of the proof. By generalizing the proof of Proposition 5.4 of 
Sturmfels (1996) we can show that A4 GB is indeed a Grobner basis. Reducedness is 
obvious. Minimality is also obvious from Theorem 3. 

6. Concluding remarks 

In this paper we investigated the structure of degree two fibers of a decomposable model 
and clarified the structure of minimal Markov bases and minimal invariant Markov bases. 
We have also shown that decomposable models possess a Grobner basis that is at the 
same time a minimal Markov basis. 

For future research it is important to investigate structures of degree three fibers, degree 
four fibers, etc. In Takcmura and Aoki (2004) we gave a characterization of minimal 
Markov bases. It shows that minimal Markov bases can be constructed "from below", 
that is, combining moves from fibers of degree 1,2,3,.... Although at the moment the 
construction cannot be implemented as an algorithm, it is important to study fibers of low 
degrees. We see that the study of degree two fibers in this paper led to some interesting 
results. As another example, in Aoki and Takemura (2009) we found some interesting 
degree three fibers in connection to experimental design with three-level factors. 

As mentioned in the Introduction, the results obtained in this paper will provide in- 
sights to some practical models such as subtable sum models (Hara et al. (2009)), models 
for contingency tables with structural zeros (Aoki and Takemura (2005); Rapallo (2006)) 
and Rasch models (e.g., Chen and Small (2005); Zhu et al. (1998); Basturk (2008)) ob- 
tained by imposing some constraints on decomposable models. We will present results 
along this line in a forthcoming manuscript (Hara and Takemura (2009)). 

It is of interest to study effects of structural zeros and observational zeros to Markov 
bases. In this respect in Hara et al. (2009) we have shown that a Markov basis for two- 
way contingency tables with structural zeros can be obtained as a subset of a Markov 
basis for subtable sum models, where the subtable sum happens to be an observational 
zero. 
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